Enriched technology-enabled annotation and analyses of child speech
نویسنده
چکیده
This paper reviews a range of studies illustrating the ways in which speech technology has enabled richer analyses of corpora of young children’s speech and infants’ speech-like vocalizations. One set of studies illustrates the use of speech synthesis technology, such as VLAM, an articulatory synthesis system that models the transfer functions of childor infantproportioned vocal tracts. VLAM has been used to evaluate cross-language differences in the emergence of contrasting vowel categories. Another set of studies illustrates the use of modern corpus development and annotation tools to create and analyze the paidologos corpus, a databse of utterances elicited in a picture-prompted word-repetition task from 2through 5year-old child speakers of a variety of languages. Flexible, incremental annotation on multiple tiers allows researchers to extract target sounds for spectral analysis as well as for calculating transcribed accuracy rates. Tag sets also can be used to extract stimuli for perception experiments, yielding naive-listener responses that can become another layer of tags.
منابع مشابه
Identification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis
Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...
متن کاملAn annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملProsodically Enriched Text Annotation for High Quality Speech Synthesis
Linguistically enriched text generated from natural language modules contributes significantly on the quality of speech synthesis. For all cases where such modules are not available, such enriched input needs to be produced from plain text in order to maintain quality. This work reports on a framework of several combined language resources and procedures (word/sentence identification, syntactic...
متن کاملA New Workflow for Semi-Automatized Annotations: Tests with Long-Form Naturalistic Recordings of Childrens Language Environments
Interoperable annotation formats are fundamental to the utility, expansion, and sustainability of collective data repositories. In language development research, shared annotation schemes have been critical to facilitating the transition from raw acoustic data to searchable, structured corpora. Current schemes typically require comprehensive and manual annotation of utterance boundaries and ort...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013